Improved voice activity detection based on a smoothed statistical likelihood ratio

نویسندگان

  • Yong Duk Cho
  • Khaldoon Al-Naimi
  • Ahmet M. Kondoz
چکیده

This paper presents the behavioural mechanism of a statistical modelbased voice activity detector (VAD), featuring a likelihood ratio test for the activity decision. From investigation of the VAD, it is found that detection errors could occur frequently at speech offset regions because of the delay term in the decision-directed parameter estimator, employed for the estimation of an unknown parameter of the likelihood ratio. Hence, this paper proposes a smoothed likelihood ratio so as to alleviate the detection errors at the offset region. Objective test results show that the proposed scheme is useful for achieving a considerable performance improvement for the VAD. Additionally, the proposed VAD gives detection performances superior to G.729B VAD and comparable with AMR VAD option 2.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust statistical voice activity detection using a likelihood ratio sign test

Voice activity detection (VAD) plays an important role on the performance of speech processing systems. Recently, more and more works focused on the statistical model-based VAD algorithms have been presented in literature, which make a decision of speech and nonspeech based on the likelihood ratio (LR). However, all the statistical models used in those algorithms are unable to exactly describe ...

متن کامل

Voice activity detection based on statistical models and machine learning approaches

The voice activity detectors (VADs) based on statistical models have shown impressive performances especially when fairly precise statistical models are employed. Moreover, the accuracy of the VAD utilizing statistical models can be significantly improved when machine-learning techniques are adopted to provide prior knowledge for speech characteristics. In the first part of this paper, we intro...

متن کامل

Selection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection

A statistical model-based voice activity detection (VAD) is a robust algorithm in noisy condition to detect speech region from input signal by speech and non-speech statistical model such as complex Gaussian probability density function (PDF). The decision rule used in this VAD is based on Bayes’ rule and considers likelihood ratios (LRs) in whole frequency region. In this VAD, however, the Bay...

متن کامل

Likelihood ratio test with complex laplacian model for voice activity detection

This paper proposes a voice activity detector (VAD) based on the complex Laplacian model. With the use of a goodness-of-fit (GOF) test, it is discovered that the Laplacian model is more suitable to describe noisy speech distribution than the conventional Gaussian model. The likelihood ratio (LR) based on the Laplacian model is computed and then applied to the VAD operation. According to the exp...

متن کامل

Voice activity detection based on a family of parametric distributions

In this letter, generalized gamma distribution (GCD) is introduced as a new statistical model of spectral distribution to be applied to the likelihood ratio test performed in voice activity detection (VAD). A gradient-based on-line algorithm is proposed to estimate the parameters of GCD according to the maximum likelihood criterion. Experimental results show that the VAD algorithm implemented b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001